Sinitic Wordnet: Laying the Groundwork with Chinese Varieties Written in Traditional Characters
نویسندگان
چکیده
The present work seeks to make the logographic nature of Chinese script a relevant research ground in wordnet studies. While wordnets are not so much about words as about the concepts represented in words, synset formation inevitably involves the use of orthographic and/or phonetic representations to serve as headword for a given concept. For wordnets of Chinese languages, if their synsets are mapped with each other, the connection from logographic forms to lexicalized concepts can be explored backwards to, for instance, help trace the development of cognates in different varieties of Chinese. The Sinitic Wordnet project is an attempt to construct such an integrated wordnet that aggregates three Chinese varieties that are widely spoken in Taiwan and all written in traditional Chinese characters.
منابع مشابه
Sinitic Wordnet: Laying the Groundwork with Chinese Varieties Written in Traditional Characters
The present work seeks to make the logographic nature of Chinese script a relevant research ground in wordnet studies. While wordnets are not so much about words as about the concepts represented in words, synset formation inevitably involves the use of orthographic and/or phonetic representations to serve as headword for a given concept. For wordnets of Chinese languages, if their synsets are ...
متن کاملSalmonella enterica serovar Enteritidis live vaccine strain in the reproductive organs of laying goose after subcutaneous vaccination
Serovar-specific real-time PCR for Salmonella enterica serovar Enteritidis (S. Enteritidis) was conductedto detect the genomic DNA of S. Enteritidis from laying goose after subcutaneous vaccination at differenttime points. Indirect fluorescent antibody (IFA) technique and immunohistochemical localization wereemployed to validate the results. The results showed that S. Enteritidis was consistent...
متن کاملStrategies of Processing Japanese Names and Character Variants in Traditional Chinese Text
This paper proposes an approach to identify word candidates that are not Traditional Chinese, including Japanese names (written in Japanese Kanji or Traditional Chinese characters) and word variants, when doing word segmentation on Traditional Chinese text. When handling personal names, a probability model concerning formats of names is introduced. We also propose a method to map Japanese Kanji...
متن کاملProcedures and Problems in Korean-Chinese-Japanese Wordnet with Shared Semantic Hierarchy
This paper introduces a Korean-Chinese-Japanese wordnet for nouns, verbs and adjectives. This wordnet is constructed based on a hierarchy of shared semantic categories originated from NTT Goidaikei (Hierarchical Lexical System). The Korean wordnet has been constructed by mapping a semantic category to each Korean word sense in a way that maps the same semantic hierarchy to the meanings of nouns...
متن کاملHantology-A Linguistic Resource for Chinese Language Processing and Studying
Hantology, a character-based Chinese language resource is created to provide an infrastructure for language processing and research on the writing system. Unlike alphabetic or syllabic writing systems, the ideographic writing system of Chinese poses both a challenge and an opportunity. The challenge is that a totally different resources structure must be created to represent and process speaker...
متن کامل